Inducing grammar from IGT
نویسندگان
چکیده
We suggest a strategy for incremental construction of deep parsing grammars from Interlinear Glossed Text (IGT). IGT is a format of representation where standard linguistics and NLP in principle meet, since they are a data-type which is often available for digitally ‘less resourced languages’ (‘LRL’). The IGT database is TypeCraft (Beermann and Mihaylov 2009, www.typecraft.org), and the grammar technology so far employed is that defined in the LKB system of (Copestake 2002), an implementation of the HPSG grammar
منابع مشابه
From IGT to precision grammar: French verbal morphology
Interlinear glossed text (IGT, the familiar three-line format of linguistic examples) can be an extremely rich source of linguistic information, when linguists follow best practices in creating it (e.g., the Leipzig glossing rules, Comrie et al. 2003). The ODIN project (http://www.csufresno.edu/odin; Lewis 2006) recognized the value of IGT data as a reusable data type and has created a searchab...
متن کاملLearning Grammar Specifications from IGT: A Case Study of Chintang
We present a case study of the methodology of using information extracted from interlinear glossed text (IGT) to create of actual working HPSG grammar fragments using the Grammar Matrix focusing on one language: Chintang. Though the results are barely measurable in terms of coverage over running text, they nonetheless provide a proof of concept. Our experience report reflects on the ways in whi...
متن کاملTowards Creating Precision Grammars from Interlinear Glossed Text: Inferring Large-Scale Typological Properties
We propose to bring together two kinds of linguistic resources—interlinear glossed text (IGT) and a language-independent precision grammar resource—to automatically create precision grammars in the context of language documentation. This paper takes the first steps in that direction by extracting major-constituent word order and case system properties from IGT for a diverse sample of languages.
متن کاملIncreased adiponectin receptor-1 expression in adipose tissue of impaired glucose-tolerant obese subjects during weight loss.
OBJECTIVE To investigate the mRNA expression of adiponectin, AdipoR1 and AdipoR2, the two recently cloned adiponectin receptors and peroxisome proliferator activated receptor (PPAR)gamma2 in adipose tissue of obese individuals before and during a very low calorie diet (VLCD) inducing weight loss. METHODS Twenty-three non-diabetic obese subjects with normal (NGT, n = 11) or impaired glucose to...
متن کاملLanguage CoLLAGE: Grammatical Description with the LinGO Grammar Matrix
Language CoLLAGE is a collection of grammatical descriptions developed in the context of a grammar engineering graduate course with the LinGO Grammar Matrix. These grammatical descriptions include testsuites in well-formed interlinear glossed text (IGT) format, high-level grammatical characterizations called ‘choices files’, HPSG grammar fragments (capable of parsing and generation), and docume...
متن کامل